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Abstract — We propose an alternative method for collaborative 
decoding of interleaved Reed-Solomon codes. Simulation results 
for a concatenated coding scheme using polar codes as inner 
codes are included. 



I. Introduction 

Reed-Solomon (RS) codes are used in many applications, 
often implemented in an interleaved form as outer codes in 
concatenated code designs. By combining and interleaving 
several Reed-Solomon codewords, correction of long error 
bursts affecting only a few symbols of the particular underly- 
ing codewords can be achieved. 

The standard decoding procedure consists of decoding each of 
the interleaved codewords separately. In recent years, methods 
have been investigated which try to decode the columns 
of the interleaved code no longer independently but in one 
step, allowing for error correction beyond half the minimum 
distance. 

Our approach is based mainly on the considerations in [ 1 1 and 
(2), involving a significantly smaller linear system of equations 
when compared to the methods using multi-sequence shift- 
register synthesis like in [3|. By choosing a parity check matrix 
of a special form - which we do not have to use explicitly - 
we obtain an error locator polynomial allowing us to correct 
errors up to the minimum distance minus 2. The proposed 
algorithm is of equivalent complexity compared to the multi- 
sequence approach, but can be easily parallelized in order to 
increase time performance. 

II. Reed-Solomon codes 

A (generalized) Reed-Solomon code of length n and di- 
mension k over a finite field F with F| = q elements can be 
defined as follows: 

Definition 1 (Reed-Solomon code). Let v := . . . , v n ) € 

F™ be a vector of n < q different elements of F. Let further 
Vk be the vector space of polynomials over F with degree 
< k. Then a Reed-Solomon code QlZS(q; n, k, v) is the set of 
evaluations at v 

{(p(»i),pW pMeF" : peT k } 

of the polynomials from Vk- 

In this work, we will make use of two special types of 
generalized Reed-Solomon codes: 



Definition 2. A Reed-Solomon-Code QlZS(q;n,k,v) with 
length n = q — 1 and 

v = {a°,a\...,a q - 2 ) 

with a being a primitive element of F will be referred to as 
US(q - 1, k). 

The extended code of length n — q obtained by adding the 
zero element ofW to the vector v oflZS(q — l,k), i.e. 

v = (C^aV 1 ,...,^- 2 ), 

will be called TZS*(q, k). 

It is known that Reed-Solomon encoding can be accom- 
plished by multiplication with Vandermonde matrices. There- 
fore, the following matrix is a possible generator matrix of 

1lS*{q,k): 
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The code lZS*(q,k) has one interesting property which has 
been stated in a more general form in [4, p. 304] and will be 
the foundation of our following considerations: 

Lemma 1. The dual code oflZS*(q, k) is lZS*(q, q — k), i.e. 
the Vandermonde matrix 
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with m := n—k is a possible parity check matrix oflZS* (q, k). 

Proof: As can easily be observed, each element of the ma- 
trix product GH T except for the upper left element (Gif T )n 



2 



takes the form 



9-2 



for some x G F \ {0, 1}. Clearly, 

x q - 1 = x° = 1 
holds for all x G F, x ^ 0. Consequently, we obtain 

9-2 9-1 



i=0 



;.= 1 



and because of x ^ 1, 



follows, (we remark that the discrete Fourier transformation 
is based on the very same argument). The residual matrix 
element an simply consists of a sum of ones which adds 
up to zero: 



(Gif T )u = = 0, 



as the order of the additive group of F is q. 

Therefore, GH T = 0. ■ 

III. Interleaved Reed-Solomon Codes and 
Collaborative Decoding 

By grouping I G N codewords of QlZS(q;n, k,v) to a 
matrix, we obtain a linear code of length (l-n), dimension (l-k) 
and minimum distance like the individual columns (n— fc + 1). 

Definition 3 (Interleaved Reed-Solomon (IRS) code). Given 
a certain Reed-Solomon code C := QlZS(q;n,k 1 v), we 
define an Interleaved Reed-Solomon code IlZS(q; /, n, fc, v) 
of interleaving depth I as the set of (n, l)-matrices 

|A= (aW a( 2 \...,a«) : (a«) T G C , i = 1, . . . , l\, 

each consisting of I column-wise arranged codewords from 
C. In case of C — lZS*(q, k), the resulting IRS code will be 
referred to as XTZS* (q, /, A;). 

Assume that a codeword A G 17ZS*(q, I, k) is transmitted 
over a noisy channel, so that 



A + E G 



with some error matrix E G F nx/ is received at the channel 
output. Let E be a matrix with exactly / G N non-zero rows. 
We denote T the set of indices of these erroneous rows. 
As each column of I1ZS* (q, I, k) is a codeword of TZS* (q, k), 
we could try to decode each of the I columns separately by 



computing a syndrome sequence s (t> :— (s\' J , . . . , s^_ k ) of 



length (n— k) and solving a system of linear equations for each 
i G {1, . . . , 1} (which can be done efficiently by algorithms 
like the well-known Berlekamp-Massey-Algorithm in C(/ 2 )). 
By this means we would be able to correct up to t := Lht^J 
errors per column which corresponds to half of the minimum 
distance of lZS*(q, k). 



For collaborative decoding, we arrange the I syndrome 
sequences of an IRS codeword as columns of a so-called 
syndrome matrix S. The computation can be written formally 
as a matrix multiplication of Y with the parity check matrix 
H of the underlying TZS* -Code: 



S = H Y = H ■ (A + E) = H E = E 



T 



(3) 



with and Ejr denoting the submatrices of H and E 
consisting only of those columns and lines, respectively, whose 
indices lie in T. The last equivalence holds because all other 
rows of E are zero. Note that according to Lemma Q] the 
computation of S consists plainly of polynomial evaluations. 
The syndrome matrix takes the form 



( 

„(!) 



\ S (1) 
\ b n-k 



(2) 
„(2) 



s (2) 



4° \ 
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b n-k/ 



Our decoding algorithm succeeds whenever the following two 
conditions are fulfilled: 

Til The non-zero rows E{ (i G J 7 ) of the received word are 

linear independent. 
%2 / < /max := min{Z,rc-fc-l} holds. 
Actually, Hi and % 2 are both necessary and sufficient for 
successful decoding. We will make some remarks on the linear 
independence condition Hi later in section [VTl 
The square submatrix of H 



t?\ £ F /x/ 

'[/] fc 



(5) 



consisting of the first / rows of is a Vandermonde and 
thus non-singular matrix. Therefore, the following line of K 

fi := (H*) f+l e F > (6) 

(note / < n — k) is an uniquely determined linear combination 
of the first / rows Kj of K : 



f 



(7) 



for some Xj G F, j = 1, . . . , /. Given these coefficients, we 
define a polynomial 



A(x) := x 



(8) 



Due to the special form of H (cf. Lemma [TJ, the ith column 
H l consists of the consecutive powers of Vi G F. Conse- 
quently, 

/ 

= A( Xl ) = x( -^A^r 1 (9) 

2=1 

holds for all i £ J. Since A is a polynomial of degree / = | F\, 
these are obviously the only roots. Therefore, A is the error 
locator polynomial. In the following, we will determine the 
coefficients Xj from eq. ([3J: 

Let ip be the linear mapping defined by Ejr g W' xl : 



p : F-^F' 



x i y x ■ Ejr 



(10) 
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By definition, 



(p(Ki) = St 



i=l,. 



(11) 



By our assumption Hi, Ejr is a matrix of maximum rank 
/. Thus, tp is an injective mapping, and the vectors Si (i — 
1, . . . , /) form a basis of the image of ip. Consequently, the 
(/ + l)th row of S is a linear combination of the former, 
uniquely determined by the coefficients of A: 



/ 



>/+i 



(12) 



These coefficients can now be calculated by performing ele- 
mentary column operations and determining a minimum index 
/ such that the (/ + l)-th row is a linear combination of the 
preceding rows S St of the syndrome matrix S. By 
applying the Gauss-Jordan algorithm to S, we obtain a matrix 
of the form 



S := 



/ 1 






Ai 




1 



A 2 



... 0\ 
... 



V : 



(13) 



'J 



Under the assumption Hi such an index / surely exists as the 
rank of S is smaller than the number of rows (n — k). 
As a consequence, we obtain 

Theorem 1. If Hi and Hi are fulfilled, then f — f and the 
polynomial 

f 



A(x) 



built from the elements Xj from (1131 > is the error locator 
polynomial, i.e. 



A(vi) = & % e T 



holds. 



The set of error locations T can now be determined by 
finding the roots of A(x) using standard methods like the 
Chien search. 

Obviously, in principle only (/ + 1) columns of S are 
necessary in order to determine A, allowing for significant 
reductions in computational complexity. We will dwell on this 
possibility in section IVlTl 

IV. Codeword reconstruction 

Given the (correctly computed) set T of erroneous columns 
of Y , we are now able to reconstruct Ejr and therefore A = 
Y E. The matrix equation 



H< 



(14) 



defines an over-determined system of linear equations consist- 
ing of l-(n — k) equations and I • / unknowns. Since we know 
that (fT4t must have an unique solution and since the rows of 



the (Vandermonde!) matrix are linear independent, we 
can restrict to the smaller system 



s lf] = H lf] E? = K E? 



(15) 



with S[f\ and Hff\ denoting the matrices consisting of the 

first / rows of S and H T , respectively. 

As mentioned before, K is a quadratic - and thus invertible 

- Vandermonde matrix and ( fl3l l actually an interpolation 

problem. 

V. Cyclic and shortened codes 

It is well known that using generalized, non-cyclic Reed- 
Solomon codes leads to an increased computational complex- 
ity of the encoding procedure as well as of the syndrome 
calculations. In our case, by skipping the first element vq = 
of v we obtain a cyclic code of length n = q — 1 which can be 
efficiently treated by use of the discrete Fourier transformation 
(DFT) or a generator polynomial. To preserve the duality 
property of Lemma Q] (which is crucial for our algorithm to 
work), we treat the evaluations at the skipped element - which 
we do not know - as a "dummy" row of the IRS codeword. 
Consequently, we only transmit the rows with indices 2 to q. 

On the receiver side we assume an all-zero first row. 
Therefore, we can again use DFT for the computation of 
the syndromes. Certainly, the correct first row (obtained by 
evaluation of the information polynomials at Vi = 0) will 
most probably not be all-zero. In this case, the error locator 
polynomial A will simply have an additional root at vi = 0, 
causing a shift of the coefficients by one. Otherwise, the 
constant coefficients of the information polynomials are in fact 
equal to zero, which has no effect on A. 
Both the maximum number of errors we are able to correct and 
the (actual used) code length decrease by one; the performance 
remains unchanged. 

Further shortening may be applied by the usual methods in 
order to obtain greater flexibility for code design. 

VI. Failure and error probability 

As shown before, in case of / < / max the success of the 
decoding procedure solely depends on the linear independence 
of the error vectors. If we assume that the Ei are random 
vectors uniformly distributed over F ; \ {0}, the probability for 
the Ei being dependent, i.e. the probability that Hi is not 
fulfilled, can be overbounded for / > 2 by 

1 - q-f 



-(«+!-/) . 



q 



-O+i-/) 



(16) 



1-g-i 

as shown in |1|. Clearly, the decoder certainly fails if the 
number of erroneous columns exceeds / max . Thus, 



/ < 2 
PriJ.I) < {q-H+i-f) 2 </</ n 

1 else 



(17) 



holds as an upper bound for the failure probability under 

assumption of uniformly distributed error vectors. 

If the rows of Ejr are linear dependent, the decoder either is 
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not able to obtain any result or decides for a wrong codeword. 
In the following, we will derive an upper bound for the second 
case: 

Looking again at the linear mapping ip from the considerations 
preceding Theorem Q] there are now some non-zero vectors 
u G W - which represent linear combinations of the rows of 
Ejr - with <p(u) = G F'. As the (now linear dependent) 
error vectors E{ are random vectors, each vector from F^\{0} 
can be assumed to have equal probability to lie in the kernel 
of cp. 

Because of the linear dependence together with H2, the rank 
t of S is certainly smaller than (n — k), and therefore the 
algorithm in any case determines a polynomial A of degree t. 
If this polynomial has exactly t different roots, we will call it a 
t-valid polynomial, in accordance to 0. If A is not i-valid, it is 
obvious that it cannot be the wanted error locator polynomial. 
In this case the algorithm reports a decoding failure without 
taking any decision. The fraction of i-valid polynomials with 
leading coefficient 1 and degree t is given by 

^-Q^^. (18) 

If the algorithm determines a minimal zero combination of 
the first rows of S which define a linear subspace of smaller 
dimension than the rank of S, the failure is also detected. 
Therefore, for an erroneous result the first t < / vectors can 
be assumed to be linear independent while any other error 
vector has to be a linear combination of the preceding. 

Consequently, the probability for obtaining a wrong result 
can be overbounded by 

Pe(f,i) < e (p v (t) ■ (1 - p/(t,o) • n (i9) 

t=2 \ j=t ) 

/-I 

<^P„(f)-fH(H (20) 

t=2 

<J2^i- il - tKf - t} - (2D 

t=2 Z - 

Since the sum is clearly dominated by its last summand, 

Pe(/ '^(Pl)!' r(,+W) (22) 

holds approximately. Note that this probability is already 
included in Pf(f, I). 

Compared with [3|, the probabilities for the decoding to fail 
are equivalent in case of the maximum number of correctable 
errors / = / max = (n — k — 1), but declines slower with 
decreasing /. Still, both Pf and P e fall exponentially with 
increasing interleaving depth I and thus can be designed 
arbitrary small by moderate expense. 

In case of concatenated code designs where the columns 
of an outer IRS code are encoded by an inner block code, 
the probability for a column of an IRS codeword to become 
corrupted is given by the frame error rate p t of the inner code. 
The overall frame error rate (FER) with respect to pi can then 



be analytically determined by 

FER = EU ) ■Pf(t,l)-lt-0--Pi) N - t - (23) 

t=2 ^ ' 

The probability that the decoder decides for a wrong codeword 
is derived in an analogous manner: 

N /N\ 

FER C - E ■ P ^ • P* ■ 0- ~ W 

t=2 \ ' 

Fig. Q] depicts the bounds on the FER as well as on the 
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Fig. 1. Bounds FER and FER C for a (204, 188) IRS code with I = 16 
for collaborative decoding using Gaussian elimination and failure bound for 
collaborative decoding using multi- sequence shift-register synthesis (MSSRS) 

error probability FER C as a function of the inner code error 
rate pi. Here, a (204, 188) shortened RS code like in the 
DVB standard |7| is used. As the corresponding IRS code 
is able to correct up to 15 erroneous columns, we chose an 
interleaving depth of I = 16 rather than I = 12 in the DVB 
standard. Fig. Q] demonstrates that the probability for taking 
a wrong decision should be usually negligible compared to 
the failure probability. The bound considered in the multi- 
sequence approach (cf. f3)) coincides with the FER bound 
derived here. 

We remark that further increasing of the interleaving depth 
I has practically no effect on the failure performance which 
is dominated by the probability that the number of erroneous 
rows exceeds / max - 

VII. Complexity 

As the system of linear equations considered in this paper 
has no Toeplitz structure in contrast to the single decoding 
case, we have to use Gaussian elimination, an algorithm known 
to have cubic time complexity. In our case of rectangular 
matrices the complexity is of order 0{l ■ f 2 ) (with I and / as 
defined before). 

However, only a single system has to be solved whereas there 
are I systems in the independent decoding case as well as in 
10. Moreover, in each step of the Gaussian elimination, the I 
columns of the syndrome matrix S may be transformed at the 
same time by a parallel implementation of the decoder. Such 
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parallelized algorithms for Gaussian elimination are able to 
reduce the time complexity up to 0(7). 
As mentioned before, rather than the whole matrix S only the 
first (/ + 1) columns are actually necessary for determination 
of the error locator polynomial. In the case of / << I 
it is therefore possible to reduce computational complexity, 
applying a decoding algorithm which successively performs 
Gaussian elimination on quadratic submatrices of S of in- 
creasing size whereas in the ith step the (i x i)-submatrix 
consisting of the first i rows and columns of S is considered. 
Decoding stops when a submatrix with linear dependent rows 
is found. The linear combination obtained in this way may 
now be "tested" on a certain number of additional columns in 
order to ensure the desired level of reliability. 

VIII. Simulation results 

Finally, we present some examplary simulations to demon- 
strate the tightness of the derived bounds and to show that the 
randomness of the error vectors can be realized in practice. 
For simulation, a (256,128) polar code as inner code is used 




Fig. 2. Simulated and analytical FER of a concatenated code composed of 
an inner (256, 128) polar code and an outer RS(204,188) code with I = 16 
as a function of inner code FER 

together with the (204,188) IRS code from section [VTI Polar 
codes [5 1, first introduced by E. Ankan, are decoded by a low- 
complexity successive decoder which generates estimations on 
the source bits one after another, each depending on the deci- 
sions made before. In case of a wrong decision, from that bit 
on unpredictable, long error sequences are produced until the 
end of the codeword. This fact (which could usually be seen 
as a drawback) makes polar codes well suited as inner codes 
in our case. However, the polar successive decoder happens 
to fail at certain bits significantly more likely than at other 
ones which prevents the error vectors from being uniformly 
distributed. It turned out be necessary to apply different bit 
permutations for each column of the IRS codeword before 
encoding with the inner polar code in order to randomize the 
error vectors at the RS decoder input. 

Fig. |2] depicts the FER of the concatenated block as a func- 
tion of the word error rate of the inner polar code, trans- 
mitted over the binary-input AWGN channel. The diamond 
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Fig. 3. Simulated and analytical FER performance of a I = 16 concatenated 
code with an outer RS(204,188) code and (a) an inner (256,128) polar code 
with standard decoding, (b) an inner (256,128) polar code with improved 
decoding, (c) an inner rate 1/2 convolutional code with constraint length K = 7 

markers correspond to simulation points where E^/Nq = 
2.85 dB. . .3.15 dB with a step size of 0.1 dB while the 
continuous line demarks the analytically derived bound from 
section [VTJ By using an improved polar decoding scheme as 
considered in [6|, the performance can be further enhanced 
by about 0.3 dB. Simulation results of those improved polar 
codes are represented by circular markers. Fig. [2] shows that 
the simulation points of both decoding schemes match the 
theoretical failure bound. 

In Fig. [5] the FER performance of the two polar coding 
schemes are compared to a concatenation of an inner rate 
1/2 convolutional code (constraint length K = 7) with an 
independently decoded outer (204, 188) RS code as used in 
the DVB standard, but for 1 = 16 instead of 1 = 12. 
Both IRS-polar concatenation schemes clearly outperform 
the DVB code in terms of frame error rates as well as of 
computational complexity. 
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